Exploratory Plots for 2017-2018 Acoustic/Fish Data
Purpose To explore the Acoustic data gathered in 2017 and 2018 to expose important trends between sites, diurnal patterns, fish abundance, lunar phase, and coral reef acoustics.
Combined Model All variables are matched to the files that were used for Fish call counts (3:00, 9:00, 15:00, 21:00)
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
Plotting explanatory (Knocks, Calls, Herbivory, Snaps) against response variables (MF and HF) SPL
Breaking down the relationship between total knocks and MF to the site and hour level
Running basic regressions linking the explanatory to the response at their lowest levels and combined to see how different sites/ hours change the regression - SPL
Linear Model outputs below each
##
## Call:
## lm(formula = SPL_HF ~ Snaps, data = Snap.HF17)
##
## Residuals:
## Min 1Q Median 3Q Max
## -7.8309 -1.9842 0.2062 1.8451 13.3944
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.053e+02 6.541e-01 160.99 <2e-16 ***
## Snaps 7.227e-03 4.475e-04 16.15 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 2.807 on 10163 degrees of freedom
## Multiple R-squared: 0.02502, Adjusted R-squared: 0.02493
## F-statistic: 260.8 on 1 and 10163 DF, p-value: < 2.2e-16
2017 Snap data, snaps significant.
##
## Call:
## lm(formula = SPL_HF ~ Snaps, data = Snap.HF18)
##
## Residuals:
## Min 1Q Median 3Q Max
## -9.4682 -1.9696 0.0058 2.4042 30.2074
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 8.617e+01 8.999e-01 95.75 <2e-16 ***
## Snaps 2.269e-02 6.168e-04 36.78 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.142 on 5823 degrees of freedom
## Multiple R-squared: 0.1886, Adjusted R-squared: 0.1884
## F-statistic: 1353 on 1 and 5823 DF, p-value: < 2.2e-16
2018 Snap data with outliers removed. Snaps significant.
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = AC.DF1)
##
## Residuals:
## Min 1Q Median 3Q Max
## -7.248 -2.267 -0.871 1.597 19.211
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.047e+02 3.888e-01 269.163 < 2e-16 ***
## Tot_Knocks 1.744e-02 4.465e-03 3.906 0.000129 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.519 on 198 degrees of freedom
## Multiple R-squared: 0.07154, Adjusted R-squared: 0.06685
## F-statistic: 15.26 on 1 and 198 DF, p-value: 0.0001287
2017-2018 data w/200 samples. 1st plot splits by site and second by hour to show any patterns before I break them down individually.
Breakdown by Site
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = s5)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.4846 -2.3049 0.1011 1.9482 6.0310
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.065e+02 1.106e+00 96.290 <2e-16 ***
## Tot_Knocks 5.551e-04 8.256e-03 0.067 0.947
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.034 on 38 degrees of freedom
## Multiple R-squared: 0.0001189, Adjusted R-squared: -0.02619
## F-statistic: 0.00452 on 1 and 38 DF, p-value: 0.9467
Site 5, knocks not significant.
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = s35)
##
## Residuals:
## Min 1Q Median 3Q Max
## -10.1201 -3.6626 0.4059 4.2686 9.1758
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 105.01636 1.19662 87.761 <2e-16 ***
## Tot_Knocks 0.03231 0.01218 2.653 0.0116 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 4.804 on 38 degrees of freedom
## Multiple R-squared: 0.1563, Adjusted R-squared: 0.1341
## F-statistic: 7.039 on 1 and 38 DF, p-value: 0.01157
Site 35, knocks significant.
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = s8)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.5526 -1.5016 0.6098 1.8588 6.6098
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 105.497101 0.700474 150.61 <2e-16 ***
## Tot_Knocks -0.006653 0.009929 -0.67 0.507
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 2.727 on 38 degrees of freedom
## Multiple R-squared: 0.01168, Adjusted R-squared: -0.01433
## F-statistic: 0.449 on 1 and 38 DF, p-value: 0.5068
Site 8, knocks not significant. Negative relationship… thats interesting.
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = s40)
##
## Residuals:
## Min 1Q Median 3Q Max
## -2.2090 -0.9792 -0.3831 0.7009 4.7409
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.041e+02 4.407e-01 236.176 <2e-16 ***
## Tot_Knocks 6.514e-03 8.094e-03 0.805 0.426
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 1.554 on 38 degrees of freedom
## Multiple R-squared: 0.01676, Adjusted R-squared: -0.009116
## F-statistic: 0.6477 on 1 and 38 DF, p-value: 0.4259
Site 40, knocks not significant.
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = s32)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.0442 -1.9728 -0.7078 0.0613 18.4340
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 103.79253 0.92602 112.084 <2e-16 ***
## Tot_Knocks 0.04784 0.01903 2.514 0.0163 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.783 on 38 degrees of freedom
## Multiple R-squared: 0.1426, Adjusted R-squared: 0.12
## F-statistic: 6.321 on 1 and 38 DF, p-value: 0.01629
Site 32, knocks significant.
Breakdown by Hour
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = h3)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.8821 -2.3813 -0.5447 2.0264 6.8553
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.043e+02 7.055e-01 147.893 <2e-16 ***
## Tot_Knocks 5.296e-03 7.304e-03 0.725 0.472
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.121 on 48 degrees of freedom
## Multiple R-squared: 0.01083, Adjusted R-squared: -0.009773
## F-statistic: 0.5258 on 1 and 48 DF, p-value: 0.4719
3AM, knocks not significant
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = h9)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.3144 -1.6662 -0.4952 0.7818 8.0555
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 102.90908 0.61924 166.186 < 2e-16 ***
## Tot_Knocks 0.05274 0.00653 8.076 1.69e-10 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 2.703 on 48 degrees of freedom
## Multiple R-squared: 0.5761, Adjusted R-squared: 0.5672
## F-statistic: 65.22 on 1 and 48 DF, p-value: 1.69e-10
9AM, knocks significant
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = h15)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.0816 -2.0625 -0.9435 1.2227 7.1127
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 105.697228 0.669185 157.949 <2e-16 ***
## Tot_Knocks -0.006816 0.011700 -0.583 0.563
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.128 on 48 degrees of freedom
## Multiple R-squared: 0.007021, Adjusted R-squared: -0.01367
## F-statistic: 0.3394 on 1 and 48 DF, p-value: 0.5629
3PM, knocks not significant
##
## Call:
## lm(formula = SPL_Midrange ~ Tot_Knocks, data = h21)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.987 -2.505 -0.915 1.457 18.595
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.060e+02 9.217e-01 114.979 <2e-16 ***
## Tot_Knocks 4.355e-03 9.860e-03 0.442 0.661
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.91 on 48 degrees of freedom
## Multiple R-squared: 0.004048, Adjusted R-squared: -0.0167
## F-statistic: 0.1951 on 1 and 48 DF, p-value: 0.6607
9PM, knocks not significant.
Summary Knocks significantly explained SPLMF at sites 35 and 32 and at 9AM.
Running basic regressions linking the wind to SPL at both HF and MF to see if wind speed is significantly affecting the sound
## Warning: Removed 1518 rows containing non-finite values (stat_smooth).
## Warning: Removed 1518 rows containing missing values (geom_point).
## Warning: Removed 1520 rows containing non-finite values (stat_smooth).
## Warning: Removed 1520 rows containing missing values (geom_point).
Wind doesn’t seem to impact SPL HF or MF in any particular direction. Although the wind range seems really small.
Acoustics Breakdown All acoustic metrics (SPL and ACI) are broken down into 3 frequency bands: Broadband (All frequencies), High Frequency (Frequencies between 1 kHz - 22 kHz), and Mid Frequency (Frequencies between 160 Hz and 1 kHz)
Note 2017 had a 10 minute duty cycle with 5 minutes recording while 2018 had a 15 minute duty cycle with 5 minutes recording, so the number of files averages differs between years
Plots of high frequency patterns, notice diurnal patterns with highest SPL at night and lowest during the day (this is shown in the literature), also notice the clear splits by site.
Notice, site 35 seems to have switched position between 2017 and 2018 but all of the other sites seem to be staying more or less in the same spot
Plots of mid frequency patterns, notice opposite diurnal patterns with highest SPL during the day and lowest at night, also notice the clear splits by site.
Also, notice that 35 does a similar switch in Mid-frequency, going from the bottom in 2017 to the top in 2018
Preliminary Models Looking into the relationships between biogenic sounds (Knocks/Calls and Snaps) and their frequency spectra (MF SPL/HF SPL) respectively.
Looking at Total Knocks only SPL MF ~ Tot_Knocks
#model 1 looking at Total Knocks only
gfit1 <- glm(SPL_Midrange ~ Tot_Knocks, data = AC.DF1, family = Gamma)
summary(gfit1)
##
## Call:
## glm(formula = SPL_Midrange ~ Tot_Knocks, family = Gamma, data = AC.DF1)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -0.068872 -0.021698 -0.008332 0.015069 0.171982
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 9.553e-03 3.461e-05 276.019 < 2e-16 ***
## Tot_Knocks -1.534e-06 3.914e-07 -3.918 0.000123 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for Gamma family taken to be 0.001098715)
##
## Null deviance: 0.22855 on 199 degrees of freedom
## Residual deviance: 0.21186 on 198 degrees of freedom
## AIC: 1068.1
##
## Number of Fisher Scoring iterations: 3
par(mfrow = c(2,2))
plot(gfit1)
summary.glm(gfit1)$coefficients
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 9.553059e-03 3.461011e-05 276.019343 5.079866e-258
## Tot_Knocks -1.533644e-06 3.913970e-07 -3.918385 1.227169e-04
Looking at Total Knocks and Number of Long Calls SPL MF ~ Tot_Knocks + Num_L_Calls
#model 1 looking at Total Knocks only
gfit2 <- glm(SPL_Midrange ~ Tot_Knocks + Num_L_calls, data = AC.DF1, family = Gamma)
summary(gfit2)
##
## Call:
## glm(formula = SPL_Midrange ~ Tot_Knocks + Num_L_calls, family = Gamma,
## data = AC.DF1)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -0.068874 -0.021712 -0.008334 0.015083 0.171968
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 9.553e-03 4.005e-05 238.518 < 2e-16 ***
## Tot_Knocks -1.533e-06 3.946e-07 -3.886 0.000139 ***
## Num_L_calls 2.677e-08 3.231e-06 0.008 0.993399
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for Gamma family taken to be 0.001104281)
##
## Null deviance: 0.22855 on 199 degrees of freedom
## Residual deviance: 0.21186 on 197 degrees of freedom
## AIC: 1070.1
##
## Number of Fisher Scoring iterations: 3
par(mfrow = c(2,2))
plot(gfit2)
summary.glm(gfit2)$coefficients
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 9.552893e-03 4.005097e-05 238.518388890 1.738362e-244
## Tot_Knocks -1.533301e-06 3.945605e-07 -3.886099544 1.391033e-04
## Num_L_calls 2.676589e-08 3.230984e-06 0.008284128 9.933987e-01
Looking at Total Knocks/Number of long calls/Herbivory SPL MF ~ Tot_Knocks + Num_L_Calls + Num_Herbivory
#model 1 looking at Total Knocks only
gfit3 <- glm(SPL_Midrange ~ Tot_Knocks + Num_L_calls + Num_Herbivory, data = AC.DF1, family = Gamma)
summary(gfit3)
##
## Call:
## glm(formula = SPL_Midrange ~ Tot_Knocks + Num_L_calls + Num_Herbivory,
## family = Gamma, data = AC.DF1)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -0.067662 -0.021756 -0.007807 0.015801 0.173266
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 9.565e-03 4.063e-05 235.409 < 2e-16 ***
## Tot_Knocks -1.539e-06 3.932e-07 -3.915 0.000125 ***
## Num_L_calls 3.309e-07 3.225e-06 0.103 0.918400
## Num_Herbivory -3.975e-06 2.555e-06 -1.556 0.121409
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for Gamma family taken to be 0.00109656)
##
## Null deviance: 0.22855 on 199 degrees of freedom
## Residual deviance: 0.20923 on 196 degrees of freedom
## AIC: 1069.6
##
## Number of Fisher Scoring iterations: 3
par(mfrow = c(2,2))
plot(gfit3)
summary.glm(gfit3)$coefficients
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 9.564541e-03 4.062942e-05 235.4092305 2.341236e-242
## Tot_Knocks -1.539367e-06 3.932317e-07 -3.9146563 1.248754e-04
## Num_L_calls 3.308596e-07 3.225337e-06 0.1025814 9.184001e-01
## Num_Herbivory -3.975108e-06 2.555300e-06 -1.5556324 1.214087e-01
Looking at Snaps and their effect on the HF SPL SPL HF ~ Snaps Distributions look normal so this is a linear model
fit4 <- lm(SPL_HF ~ Snaps, data = AC.DF1)
summary(fit4)
##
## Call:
## lm(formula = SPL_HF ~ Snaps, data = AC.DF1)
##
## Residuals:
## Min 1Q Median 3Q Max
## -7.4772 -2.6200 -0.4764 2.6614 8.3553
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 91.690030 5.741405 15.970 < 2e-16 ***
## Snaps 0.017654 0.003924 4.499 1.16e-05 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3.549 on 198 degrees of freedom
## Multiple R-squared: 0.09275, Adjusted R-squared: 0.08817
## F-statistic: 20.24 on 1 and 198 DF, p-value: 1.162e-05
par(mfrow = c(2,2))
plot(fit4)
summary(fit4)$coefficients
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 91.69002981 5.741405123 15.969963 1.943186e-37
## Snaps 0.01765414 0.003923967 4.499054 1.162338e-05